Search CORE

72 research outputs found

Demystifying Unsupervised Semantic Correspondence Estimation

Author: Aygun Mehmet
Mac Aodha Oisin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/10/2022
Field of study

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Author: Brostow Gabriel J.
Godard Clément
Mac Aodha Oisin
Publication venue
Publication date: 12/04/2017
Field of study

Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we innovate beyond existing approaches, replacing the use of explicit depth data during training with easier-to-obtain binocular stereo footage. We propose a novel training objective that enables our convolutional neural network to learn to perform single image depth estimation, despite the absence of ground truth depth data. Exploiting epipolar geometry constraints, we generate disparity images by training our network with an image reconstruction loss. We show that solving for image reconstruction alone results in poor quality depth images. To overcome this problem, we propose a novel training loss that enforces consistency between the disparities produced relative to both the left and right images, leading to improved performance and robustness compared to existing approaches. Our method produces state of the art results for monocular depth estimation on the KITTI driving dataset, even outperforming supervised methods that have been trained with ground truth depth.Comment: CVPR 2017 ora

arXiv.org e-Print Archive

Crossref

UCL Discovery

Incremental Generalized Category Discovery

Author: Mac Aodha Oisin
Zhao Bingchen
Publication venue
Publication date: 15/01/2024
Field of study

We explore the problem of Incremental Generalized Category Discovery (IGCD). This is a challenging category incremental learning setting where the goal is to develop models that can correctly categorize images from previously seen categories, in addition to discovering novel ones. Learning is performed over a series of time steps where the model obtains new labeled and unlabeled data, and discards old data, at each iteration. The difficulty of the problem is compounded in our generalized setting as the unlabeled data can contain images from categories that may or may not have been observed before. We present a new method for IGCD which combines non-parametric categorization with efficient image sampling to mitigate catastrophic forgetting. To quantify performance, we propose a new benchmark dataset named iNatIGCD that is motivated by a real-world fine-grained visual categorization task. In our experiments we outperform existing related methods

Edinburgh Research Explorer

Context Embedding Networks

Author: Kim Kun Ho
Mac Aodha Oisin
Perona Pietro
Publication venue
Publication date: 29/03/2018
Field of study

Low dimensional embeddings that capture the main variations of interest in collections of data are important for many applications. One way to construct these embeddings is to acquire estimates of similarity from the crowd. However, similarity is a multi-dimensional concept that varies from individual to individual. Existing models for learning embeddings from the crowd typically make simplifying assumptions such as all individuals estimate similarity using the same criteria, the list of criteria is known in advance, or that the crowd workers are not influenced by the data that they see. To overcome these limitations we introduce Context Embedding Networks (CENs). In addition to learning interpretable embeddings from images, CENs also model worker biases for different attributes along with the visual context i.e. the visual attributes highlighted by a set of images. Experiments on two noisy crowd annotated datasets show that modeling both worker bias and visual context results in more interpretable embeddings compared to existing approaches.Comment: CVPR 2018 spotligh

arXiv.org e-Print Archive

Caltech Authors

Incremental Generalized Category Discovery

Author: Mac Aodha Oisin
Zhao Bingchen
Publication venue
Publication date: 17/08/2023
Field of study

arXiv.org e-Print Archive

Becoming the Expert - Interactive Multi-Class Machine Teaching

Author: Brostow Gabriel J.
Johns Edward
Mac Aodha Oisin
Publication venue
Publication date: 01/03/2015
Field of study

Compared to machines, humans are extremely good at classifying images into categories, especially when they possess prior knowledge of the categories at hand. If this prior information is not available, supervision in the form of teaching images is required. To learn categories more quickly, people should see important and representative images first, followed by less important images later - or not at all. However, image-importance is individual-specific, i.e. a teaching image is important to a student if it changes their overall ability to discriminate between classes. Further, students keep learning, so while image-importance depends on their current knowledge, it also varies with time. In this work we propose an Interactive Machine Teaching algorithm that enables a computer to teach challenging visual concepts to a human. Our adaptive algorithm chooses, online, which labeled images from a teaching set should be shown to the student as they learn. We show that a teaching strategy that probabilistically models the student's ability and progress, based on their correct and incorrect answers, produces better 'experts'. We present results using real human participants across several varied and challenging real-world datasets.Comment: CVPR 201

arXiv.org e-Print Archive

Crossref

Spiral - Imperial College Digital Repository

ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation

Author: Bilen Hakan
Mac Aodha Oisin
Mariotti Octave
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/02/2022
Field of study

Edinburgh Research Explorer

ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance Fields

Author: Bilen Hakan
Mac Aodha Oisin
Mariotti Octave
Publication venue
Publication date: 25/11/2022
Field of study

Edinburgh Research Explorer

Visual Knowledge Tracing

Author: Kondapaneni Neehar
Mac Aodha Oisin
Perona Pietro
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/10/2022
Field of study

Edinburgh Research Explorer